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I. INTRODUCTION 

Few realistic agent-based models of financial markets 
can be understood in depth. Among these, minority 
games (MGs) [1, 2] are perhaps the most studied at a fun- 
damental physical level, especially through a systematic 
use of spin-glass techniques [3-7] . The inclusion of many 
important aspects of market dynamics in the standard 
MG setup, however, often leads to considerable technical 
difficulties and raises new challenges for statistical me- 
chanics. A particularly important modification concerns 
the memory of agents. In the original game, the learning 
dynamics on which traders base their strategic decisions 
is such that they remember all their past payoffs irre- 
spective of how far in time they occurred. It is however 
reasonable to think that real traders tend to base their 
choices only on the most recent events. This rather natu- 
ral extension was originally introduced in [8] for a model 
in which agents play the MG strategically. For the pur- 
pose of modeling financial markets, the relevant situation 
is instead that of price-taking, or naive, agents. In this 
case, it has been argued that finite score memory gives 
rise to a surprisingly rich dynamical phenomenology [9]. 
In this paper, we will analyze such models in detail. 

There are of course several ways to introduce a finite 
memory in the MG. A conceptually simple one is to fix 
a time window M (the 'score memory') during which 
agents keep exact track of their scores [10, 11]. The main 
advantage is that the game becomes Markovian of order 
M. This situation can be handled numerically for reason- 
ably small M. Here, we are interested in the case in which 
M is of the order of the number of traders N, which is 
supposed to be very large (ultimately, the limit N — > oo 
will be considered). For the sake of simplicity, we choose 
to implement the situation in which scores are exponen- 
tially damped in time, which requires only a minor mod- 
ification of the original equations. Furthermore, we shall 
focus on the grand-canonical MG (GCMG) [12], which 
is known to produce market-like fluctuation phenomena 
and whose properties have been shown to be extremely 
sensitive to the introduction of a finite memory [9] . Both 



the 'on-line' and 'batch' versions of the model will be 
addressed. The two situations differ by the timescales 
over which agents update their status: in the former, the 
updating takes place at every time step; in the latter, it 
occurs roughly once every P time steps, with P — O(N). 
In particular, following [13], we use a parameter that in- 
terpolates between the 'on-line' and 'batch' models and 
study how the resulting fluctuation phenomena are af- 
fected by changes of updating timescales. 

We shall proceed by defining the models (Sec. II) and 
exploring, via computer experiments, their behaviour 
(Sec. III). We will focus especially on the emergence and 
parameter-dependence of 'stylized facts', peculiar statis- 
tical regularities that are empirically observed in financial 
markets. In Sections IV and V we will characterize ana- 
lytically the stationary states of the model with infinite 
and finite memory, respectively. We use static replica- 
based minimization techniques in order to compute the 
properties of the on-line game with infinite score mem- 
ory. For the batch game, one can resort to dynamical 
mean-field theory. The resulting theory is exactly solv- 
able in the case of infinite memory. For finite-memory 
games, we have to resort a Monte Carlo scheme, known 
as Eissfeller-Opper method [14], to extract the steady 
state from the dynamical mean-field equations. Finally, 
we formulate our concluding remarks in Sec. VI. For the 
sake of completeness, we add an Appendix in which the 
effects of finite score memory in the standard MG are 
discussed. 



II. DEFINITION OF THE MODEL 

In a minority game, at each time step t, N traders 
are faced with two choices: to buy or to sell; those 
who happen to be in the minority win. The game is 
repeated, and the traders' actions are determined by a 
simple re-inforcement learning. Each agent i has his own 
fixed trading strategy = {af} that specifies an ac- 
tion £ { — 1, 1} for each of the // = l,...,P states 
of the world. Each component of every strategy is ran- 



2 



domly drawn from { — 1, 1} with uniform probability be- 
fore the beginning of the game. The adaptation abilities 
of agent i are limited to choosing whether to participate 
or to withdraw from the market, denoted respectively by 
rii(t) = 1 and nj(i) = 0. At time t, the state of the world 
/j,(t) is drawn equiprobably from {1, . . . , P}. Agent i sets 
his rii(t) according to the sign of his strategy score, de- 
noted by Ui(t). In particular, he plays nj(i)af where 
rii(t) = ®[Ui(t)] and G(x) is the Heaviside step function. 
The total excess demand A(t) at time t, namely the nu- 
merical difference between buyers and sellers, is 

A(t) =$>(*)<#<*>. (1) 

i=l 

All the agents then update their score according to 

Vi(t + 1) = (l - W(t) - ^A(t) - | (2) 

with Xi > a constant. The (1 — Xi/P) term is re- 
sponsible for finite score memory: if Aj = 0, the agent 
has infinite score memory, as usually assumed in MGs, 
while for A, > he has an exponentially damped mem- 
ory of his past scores. More precisely, the number of time 
steps needed to forget a fraction /, of his/her payoff is 
ln(l - /j)/ln(l - Xi/P); for Xi/P < 1, it is proportional 
to P/Xi. The payoff - af (t) A(t) is that of a Minority 
Game. The last term sets a benchmark that agent i 
has to beat in order to participate in the market. For in- 
stance, ej can be thought of as the interest of a risk-free 
account (see [12, 15] for more details). 

Although our analysis can be extended easily to a more 
heterogeneous case, we assume, for the sake of simplic- 
ity, that Xi — A and that there are two groups of agents: 
those with ej = — oo, referred to as 'producers', who al- 
ways take part in the market, and the rest, who have 
f-i = e finite and are called 'speculators' [16]. Traders 
with e > (resp. e < 0) are risk-averse (resp. risk- 
prone), i.e., they have an incentive to stay out of (resp. 
enter) the market. We denote by N s and N p the num- 
ber of speculators and producers, respectively. The pro- 
ducers, being deterministic, inject information into the 
market, that the speculators try to exploit. This setup, 
which defines an ecology of market participants, has been 
introduced in [15] as the simplest tractable interacting 
agents model able to reproduce the 'stylized facts' of fi- 
nancial markets [12]. In the statistical mechanics ap- 
proach, one is interested in the limit of large systems, 
in which P,N s ,N p — > oo keeping the reduced number 
of agents n s = linip^oo N s /P and n p — linip^oo N p /P 
fixed. 

Up to now, agents update their variables rii(t) at each 
time step. It is natural in financial markets to assume 
that traders prefer not to change their strategy every 
time step in order to avoid over-reacting, and also be- 
cause estimating the performance of a strategy needs 
some time. This can be approximated in our model by 



allowing agents to change their variable rii every T time 
steps [13]. For the sake of simplicity, we assume that 
agents perform the updates synchronously. If T » P, 
the score update between t and t + T is essentially an 
average of score increases over all the states of the world. 
In this limit, defining t' as t/T, one can rewrite (2) as 

N 

yi (t' + 1) = (1 - Xi) yi (t') - Jij^jit') - ae t (3) 

with a = P/N — (n s + rip)^ 1 and quenched random 
couplings Jij given by 

Because this dynamics is equivalent to enumerating all 
/i e {1, . . . ,P}, we assumed that all the states of the 
world always occur between t' and t' + 1. The neural 
network literature refers to models with T = 1 as 'on- 
line', whereas models such as (3) are called 'batch'. The 
parameter T allows the interpolation between the former 
and the latter. 

Batch minority games were introduced in [17] and 
solved later exactly with generating functionals [5] , an ex- 
act dynamical method. The stationary states of (2) and 
(3) are, strictly speaking, different even in the thermody- 
namic limit. This is because agents in batch games have a 
longer auto-correlation: for instance, denoting = (rij) 
the time average of rij(i) in the stationary state, one has 
(mrij} — <j)i4>j in on-line games [3, 7], but not in batch 
games [7]. 

III. STYLIZED FACTS 

The connection between minority game's outcome A 
and real prices comes from relating A to the 'excess de- 
mand', and linking the evolution of the price p(t) to it 
via 

logp(i + l) = logp(t) + ^ (5) 

where L is a constant, the 'liquidity', that will be here- 
after fixed to 1 [18, 19]. While the original MG revealed 
insightful relationships between price fluctuations and 
predictability, it fails to reproduce the empirically ob- 
served market-like behaviour, in particular the so-called 
'stylized facts'. As a consequence, GCMGs were intro- 
duced to better mimic market dynamics. They are able 
to produce stylized facts such as fat-tailed price return 
distributions P(A) oc A~P with (i ~ 3.5 and volatility 
clustering {A(t) 2 A(t + t) 2 ) cx t~ 7 with 7 ~ 0.3; these 
exponents are close to those measured in real markets. 
They are however unable to reproduce well documented 
over-diffusive price behavior [20, 21], because the MG 
induces a mean-reverting process. 
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FIG. 1: Return as a function of time for the same realisation 
of a GCMG with biased initial strategy scores and A = 
(black lines), and A = 0.003 (red lines). System set in the 
critical window, P = 25, n s = 40, e — 0.01 



The presence of stylized facts in the GCMG with A = 
was linked to a too small signal-to-noise ratio, suggesting 
that marginal efficiency is a necessary condition for the 
existence of stylized facts [12]. In other words, in infinite 
systems, stylized facts only occur at a the phase transi- 
tion, whereas in hnite systems, critical- like phenomenol- 
ogy is observed in a critical window around the critical 
point, which shrinks as the system size increases (see [12] 
for more details and for a way of keeping alive stylized 
facts in infinite systems). Therefore, the stylized facts of 
this model cannot be studied by the methods of statisti- 
cal mechanics. 

While the ability of the GCMG to produce stylized 
facts was emphasised in previous work, Ref. [9] pointed 
out two delicate problems of GCMG regarding stylized 
facts. The first one is the dependence of stylized facts 
on initial conditions: assume that one given realization 
of the game produces stylized facts; changing slightly the 
initial conditions yi(0) is then enough to destroy them, 
leading to scenarios with only Gaussian price changes. 
The second problem is the following: for a given set of 
parameters, one realization of the game may produce 
stylized facts but another not. Both problems are due 
to the coincidence of fixed disorder in the strategies and 
infinite score keeping. This ultimately motivates the in- 
troduction of finite score memory. Fig. 1 compares A(t) 
for the same game with yi(0) ^ 0, once with A = and 
once with A > 0. In the latter case, fluctuations have a 
characteristic pattern as a function of time: they first de- 
crease to a very small value, stay at this level for a time 
interval of order 1/A and then increase, producing fat- 
tailed price returns. The same kind of volatility behavior 
occurs in the original MG (see Appendix A). 

However, finite score memory is a double-edged sword. 
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FIG. 2: Absolute- valued price return auto-correlation func- 
tion for increasing A for a given realisation of an on-line game; 
inset: same data in double-linear scale (10 7 iterations per run, 
after 200P iterations; P = 20, n s = 20, n p = 1, e = 0.01) 

First of all, certain empirical stylized facts, in particu- 
lar 'volatility clustering', suggest the presence of long- 
memory effects in markets. If one defines volatility clus- 
tering via the requirement 

(\A(t)\\A(t + r)\) ^ 
C(T) - (A{tf) (6) 

it is easily understood that any positive value of A de- 
stroys this power-law dependence, because of the cut-off 
that it imposes at times of the order of P/\ ln(l — A)|. Fig. 
2, by displaying C(r) for r < 1000, is not able to show 
this effect for A < 0.001; the cut-off is clear for larger A. 
Not only C(r) is cut off, but the loss of memory induces 
a negative C(r) at large r (see inset). 

The other potentially nefarious effect of finite score 
memory is threaten the power-law tails: GCMGs pro- 
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FIG. 3: Average price return distribution function of on-line 
games for increasing A. Average of 100 samples on 10 6 itera- 
tions per run, after 200P iterations; P — 20, n 3 = 20, n p — 1, 
e = 0.01. 



4 




FIG. 4: Average price return distribution function of on-line 
games for increasing A. Average of 100 samples on 10 6 itera- 
tions per run, after 200P iterations; P — 20, n s = 40, n p = 1, 
e = 0.01. 



duce power-law tailed A because of a volatility feed-back 
[12]. This feed-back needs some time to establish, hence 
if the memory length associated to A is smaller than this 
time, power-law tails should disappear. This is indeed 
the case (see Figs 3 and 4), in a queer way: the central 
part of P{A) is exponential, but the size of the support 
of P(A) actually increases because of the appearance of 
two peaks. Note that maximum value A* of A for which 
stylized facts are preserved depends on the system's pa- 
rameter, as shown by these two figures. 

We conclude that, in practice, a sufficiently small 
A > A* ~ 0.001 preserves the salient stylized facts: the 
noise of C(r) for r > 100 in GCMG or in financial market 
data is such that the values of A that preserve power-law 
tails of P(A) do so for C(l). Therefore, the introduction 
of finite score memory does not affect significantly the 
market-like phenomenology produced by GCMG. The 
value of A* obtained has to be contrasted with the small 
typical time-window used e.g. in [10, 11]. 

Let us now investigate the effect of updating the rii (t) 
every T time steps (see Fig. 5). Increasing T, one inter- 
polates between on-line games (T = 1) and batch games 
(T ^> P). One clearly sees that this destroys large price 
changes: while P(A) has power-law tails when T = 1, it 
is gradually transformed into an exponential distribution 
for T 3> P- This is because large price changes is associ- 
ated to a given pattern \i [9, 11] that varies as a function 
of time, as illustrated by Fig. 6. However, it should be 
noted that T ~ P gives rise to a cleaner power-law. 

Batch games display the same kind of transition when 
A is increased. Fig. 7 shows that the largest value of A 
such that P\ (A) ~ Po(A) is smaller and around 0.002 for 
n s = 20, P = 20. Interestingly, the distribution of price 
changes P(A) is more or less stable with respect to A as 
long as A < 0.004 in this figure. This is an important 
condition for the use of finite memory in these models. 




FIG. 5: Average price return distribution function for increas- 
ing T. Average of 100 samples on 1,000,000 iterations per run, 
after 200P iterations; P = 20, n s = 40, n p = 1, e = 0.01. 
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FIG. 6: Price return as a function of time for various patterns 
fi. A symbol is plotted only when fj,(t) = fj.. 



IV. STATIONARY STATE WITH A = 

This section characterizes the steady state of both the 
on-line (Eq. (2)) and batch GCMG (Eq. (3)) for A = 0. 
The relevant macroscopic observables are as usual the 
'volatility' a 2 and the 'predictability' H, given respec- 
tively by 



(A 2 ) 



H 



{AW 



P 



(7) 



where (• • •) and (• • • \ denote time averages in the sta- 
tionary state, the latter conditioned on the occurrence 
of the piece of information /i, and the over-line denotes 
an average over information patterns. In on-line games, 
these two quantites are linked by 



H 



G 



(8) 
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FIG. 7: Average price return distribution function of batch 
games for increasing A. Average of 100 samples on 1,000,000 
batch iterations per run, after 200P batch iterations; P — 20, 
n s = 20, n„ = 1, e = 0.01. 



where G = J2?=i 4>i/ p , <A 



denoting the probabil- 



ity that speculator i joins the market in the steady state. 
The normalizing P factors have been introduced in or- 
der to ensure that all quantities remain finite when P, 
In addition to these, an important role is 



N„, N, 



p 



played by the relative number of active speculators 



i=l 



(9) 



as well as by the number of active speculators per pat- 
tern, i.e. n act = n s 4>. 



A. On-line GCMG: static approach 

Partial results from replica-based calculus are reported 
in Ref. [12], and are based on the existence of a global 
quantity H e that is minimized by the dynamics. Finding 
H e relies on the prescription given in Ref. [22]: first 
derive the continuous-time stochastic differential version 
of Eq. (2), which reads 



j/i(r) = -ai(n t A) - e + 77,(7-) 



(10) 



where r = f/P is the intrisic time of the GCMG, 771 (t) 
is a zero-average Gaussian noise with {T]i{T)r)j(r')) = 
jjaiaj(A 2 ) y 5(T— r'), and the over-line denotes an average 



over information patterns: 



= (iap)e; =1 



The 



deterministic term of this equation can be interpreted 
as the gradient of P e , so that H e = H + 2eJ^ i _f 1 <^j. 
Stationary states correspond then to the minima of H c . 
When the stationary state is unique, its equilibrium prop- 
erties are entirely determined by H e ; otherwise, one has 
to supplement H e with a self-consistent equation for com- 
puting correlations [22]. Regarding H c as a cost function, 



one may compute the minima of H e from the partition 
function Z{(3) — Tr e~P Hc . The typical properties of 
the minimum of H e , i.e. of the j3 — * 00 limit of free en- 
ergy, require the evaluation of the quenched-disorder av- 
erage [logZ] dis , which is performed via the replica trick 
[logZ] dis =lim n ^ log[(Z")] dis In. 

Reference [12] reported plots of the exact solution. 
Here we give the final results of the calculus only, as 
the latter is standard (see however [16] for more de- 
tails). With replica symmetric ansatz, the free energy 
/ = {1/09) In Z/d(3, which corresponds to H e in the limit 
— ► 00, is given by 



f(g, r ) = log 



1 



20(G-g) 
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1 + X 



2 (RG - rg) - ±(log J\n e-^}ll) 



a0 



where we found it convenient to define the "potential" 



V z (ir) 



a(i{R - r) 



ar z 7r + 2e7r 



(12) 



so that the last term of / looks like the free energy of a 
particle in the interval [0, 1] with potential V z (tt) where z 
plays the role of disorder. G = 2j=i 4>i is the self-overlap 
and (7, R, r are Lagrange multipliers. 

The four saddle point equations have exactly the same 
form as MG without the strategy: 



dj_ 
dg 
df_ 
dG 
5/ 
OR 
df 
dr 



4(p + g) 
0{R~r) = - 



In the limit /3 - 
— * G and r — * R. 

W{G - g) 

X = : 

a 



(13) 
(14) 

(15) 
(16) 



> we can look for a solution with 
It is convenient to define 



d(G - g) = 



and C = -J-P(R-r) (17) 



and to require that they stay finite in the limit (3 — > 00. 
The potential can then be rewritten as 



(18) 



The averages are easily evaluated since, in this case, 
they are dominated by the minimum of the potential 
V z (ir). Let K be e(l + x)i the minimum of V^(7r) is at 
7T = for z < C K and at tt = +1 for z > C(l + K). For 
C K < z < ([1 + K], the minimum is at tt — z/( — K. 
With this we find, 

((ttz)) = 1 {erf[(l + K)(/V2] - erf^C/^)} (19) 
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and 



= G = 



(jf.lje-Wt'/J.jfe-JfV/J 

(20) 



B. Batch GCMG: dynamical approach 

We consider the batch-GCMG dynamics, which we re- 
cast as 



+ -crfc[(l + X)C/V2] 

The fraction of agents who never enter into the market 
is then 0o = (l+erf(£lf/\/2)/2, and the ones who always 
participate is fa = erfc[(l + if )C/V2]/2. The pdf of the 
-Ki is given by 



N 

yi(t + 1) = yi{t) - ^2 J ij n 3 

3 = 1 



(t) 



hi(t) (26) 



V(ir) = fa8{Tr) + faS(Tr - 1) + _^ e -(C(-+^)) 2 /2 



2tt 



And the average number of agents in the market 
where the average is over V(n), is 



(21) 

= <7T> 



h + 



7= 



C 2 /2 



-C 2 {l+K) 2 /2 ' 



CV27T 



+ _ ( cr f(^ C /V2) - erf [(1 + Jf)C/V^]) 



Observing that £ = y / a/(p + G), one finally finds that ( 
is fixed as a function of a and p by the equation 



a 

= P 



, _ (^-l)c-( 1+ ^ 2 / 2 -^c- x2z2 / 2 
+ i + (erf [(1 + ^)C/V2 - erf (K(/V2 
+ ierfc[(l + if)C/V2] 



(22) 



which has to be solved numerically. With some more 
algebra, one easily finds : 



K = e 



1 - 



erf [(1 + K)Q/s/2] - erf {K^/s/2) 



2a 



(23) 



These two last equations form a close non-linear set of 
equations. 

The calculus gives finally H t = lim^o^ 



n p + n s G((,K) 

(i + x) 2 



2efaK,0 



(24) 



The stationary is unique if H e ^ 0, which is the case 
as long as e ^ and for n s < n* s (n p , e) if e = 0. 
The fluctuations are given by 



2 2 n P + n sG . 

<J = e f- - G). 



7f 2 



(25) 



One can also show that H € oc e 2 for small e: according 
to Eq. (24), this hold if K — > < if < co, which can 
be seen by expanding Eq. (23) in powers of e. 



with 



£i = 



e for 1 < i < N s 

-co for iV s + 1 < i < N s + JV p = TV 



The subscripts s and p denote speculators and producers, 
respectively. The relevant dynamical variable is rii(t) = 
Q[yi(t)] (for producers, = 1), while the random 

couplings Jij are given by (4), = (1/AT) ^ of , 
with a^ 1 G { — 1,1} iid quenched random variables with 
uniform probability distribution. For simplicity, we set 



_ P _ 



1 



(27) 



with n s — N s /P and n p — N p /P. The external sources 
hi(t) have been added in order to probe the system 
against small perturbations. In the following, we denote 
averages over all possible time-evolutions (paths), i.e. re- 
alizations of (26), by double brackets ((• • •)). 

The standard tool for investigating the dynamics of 
statistical systems with quenched disorder is the path- 
integral method a la De Dominicis, based on the evalua- 
tion of the generating functional 



exp 



(28) 



dis 



from which disorder-averaged site-dependent correlation 
functions of all orders can be derived via such identities 



as 



[«ni(*)»] 



dis 



i lim 



dZ[i/>] 
<A"b dipi{t) 

d 2 z[ip] 



(29) 
(30) 



In turn, macroscopic (auto)-correlation and response 
functions like 



C(t,t')= lim l^[((^(tK(t')»]d 



i=l 
N 



G(t,t') = lim 1 



i=l 



dhi(t') 



(31) 
(32) 



can in principle be evaluated by simply taking derivatives 
of Z with respect to the sources {ipi, hi}. The calculation 
of Z in the limit N — > oo leads, via a saddle-point inte- 
gration, to the identification of a non-Markovian single 
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('effective') agent process that provides a complete de- 
scription of the original Markovian multi-agent process 
(26). This procedure requires a straightforward varia- 
tion of the cases dealt with in the existing literature and 
we will not report it in detail. For our purposes, it will 
suffice to say that the effective dynamics for speculators 
is given by 

y(t + 1) = y(t) - a £(J + G)-\t, t')n{t') 



t'<t 



ae + h(t) + sfaz{t) (33) 



with n(t) = Q[y(t)], whereas n(t) = 1 for producers al- 
ways. Here, / is the identity matrix, G is the response 
function (32), and z(t) is a Gaussian noise with zero av- 
erage and time correlations 

(z(t)z(t')) = [{I + G)-'C{I + G T )-\t,t>) (34) 

with C the correlation function (31). Notice that the 
coupling between the two groups is provided in essence 
by the noise term, since both speculators and producers 
contribute to C. In fact, the correlation function can be 
written as 



C{t,t') = n s aC s {t,t') + n p a 



(35) 



where n 8 a (resp. n p a) is the fraction of speculators (resp. 
producers), and C s (resp. C p ) denote the correlation 
function of speculators (resp. producers). For the re- 
sponse function we have, similarly, 

G(t,t') = n s aG s (t,t') +n p aG p (t,t') 

= n s aG s (t,t') (36) 

producers being 'frozen' at n — 1 and thus insensitive to 
small perturbations. 

Assuming time-translation invariance, 



lim C(t + r,t) = C(t) 

t— >O0 

lim G(t + T,t) = G(t) 
finite integrated response, 

lim VG(i,(') < oo 

t'<t 

and weak long-term memory, 

lim G(t, t') = W finite 



(37) 
(38) 



(39) 



(40) 



crgodic steady states of (33) can be characterized in 
terms of a couple of order parameters, namely the per- 
sistent autocorrelation 



c= lim -VC(r) 

r — >-oo t ' J 



(41) 



and the integrated response (or susceptibility) 
X = lim ]TG(t) 



From (35) and (36) we get 



c = n s ac s + n p a 
X = n s axs 



(42) 



(43) 
(44) 



where c s and Xs are the persistent autocorrelation and 
susceptibility of speculators. For A = one can formu- 
late, inspired by computer experiments, a simple Ansatz 
for the dynamics of scores that allows to calculate these 
quantities exactly as functions of c and x> so that from 
(43) and (44) one may retrieve the values of the persis- 
tent order parameters for any n s and n p . In this case, 
ergodicity breaks down as x diverges at certain critical 
values of the parameters, thus violating (39). This analy- 
sis, including the dynamical phase transition, reproduces 
the phenomenology of the batch-GCMG without memory 
remarkably well, at least in the ergodic regime. 

The score y(t) of speculators either grows linearly with 
time as y(t) ~ « t (in which case the agent is 'frozen' 
at inactivity with n{t) = if v < or activity with 
n(t) = 1 if v > 0), or keeps oscillating about y(t) = 
(in which case the agent is 'fickle') [23]. In order to 
distinguish between the two situations, we introduce the 
variable y(t) — y(t)/t. Using this, we can sum (33) over 
time to obtain 



1 



y(t + 1) - -y(l) 



t't" 



-ae+^^>(i') (45) 



t 

V 

where n(t) = Q[y(t)]. In the limit t — > oo, the above 
leads, via (37-40), to a simple equation for the quantity 
y = lim t _>ooJ/(*): 

y = — ae + ^/az (46) 



1 + X 



where x is given by (42), n = lim r ^ 00 (l/T) J2 t < T 
and z = lim T ^ 00 (l/r) J2t< T 1S a zero-average Gaus- 
sian rv with variance 



(47) 



Defining 7 = <*/a/(l + %), we can proceed as usual 
by separating the frozen speculators from the fickle ones. 
We have the following situation: for y > the effective 
speculator is always active (n = 1) and z > 7 + ^/ae; for 
y < the effective speculator is always inactive (n = 0) 
and z < y/ae; for y = the effective speculator is fickle, 

n = z ~^" e and ^fae < z < 7 + ^fae. So we have 
c s = (6(z-7 - v/ae)) 

+ (Q(z - V^e)6(7 + V^e - z) ( j ~ ) (48) 
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with brackets denoting an average over z. For Xs, we use 
the fact that the noise z(t) formally acts like an external 
source in (46), so that Xs — (l/v / ^)(fr)- This gives 



(8(2 - Vae)©(7 + Vae - z)) 



(49) 



One can also calculate the average activity level of spec- 
ulators as 



= {n) = <6(z-7-V^e)) + 

(Q(z - Vae)9(7 + V7?e - z){- 
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)) (50) 



from which the number of active speculators per pattern 
follows as n act = n s 4>, and the fraction of frozen specula- 
tors as 

/ = (0(2 - 7 - V5e)) + (0(ae - z)) =h + h (51) 

where /i (resp. /o) stands for the fraction of always 
active (resp. inactive) speculators. Inserting (48) and 
(49) into (43) and (44) one obtains two equations that can 
be solved self-consistently for c and x- These equations 
can be analyzed for any e. For the sake of simplicity, we 
focus on the case e = 0, in which the averages over z take 
a particularly simple form. We have 



c = n p a + n s a 




c 

27TQ 



1 + X 



whereas 6 reads 



c 

2twl 



(54) 



As a quick consistency check, one can see, starting 
from (53) and with minor manipulations, that a diver- 
gence of the susceptibility (i.e. the violation of (39) 
with consequent ergodicity breaking) for n p = 1 oc- 
curs at n s — n* = 2/crf(£*) where £* is the solution 
of the transcendental equation = £\/7r. The result, 
n* = 4.14542..., is in full agreement with the replica 
results of both the on-line and the batch-model. 

In order to compare with the computer experiments 
discussed above, we analyze the fraction of active spec- 
ulators and the volatility a 1 obtained from (52-53) at 
rip = 1. As said above, the former is just n ac t = n s <j). As 
for the latter, it is formally given by 



= lim (z(t)z(t))/a 

t — »CO 



(55) 




FIG. 8: GCMG with A = 0: number of active speculators per 
pattern n act = n a (j> (top) and volatility (bottom) at n v = 1 
as a function of n a for different values of e. The dashed ver- 
tical line marks the position of the critical point n* . Markers 
denote results from computer experiments. 



It is possible to derive an approximate expression for the 
above limit in terms of persistent order parameters as- 
suming that the retarded self-interaction of fickle spec- 
ulators is negligible, that is, by neglecting the agent's 
auto-correlation. This leads to [5] 

2 rip + njx 1 

i + xr 4 



Comparing with the replica result, one sees that a 2 com- 
prises a Hq term, as expected from the relationship (25). 
However, the other terms differ, as in the standard MG. 

(52) In Fig. 8 we report the behavior of <f> and a 2 obtained 
from (52-53) at n p = 1. Dynamical results for the 
batch model are in excellent agreement with the simu- 
lations of the batch-GCMG and reproduce qualitatively 

(53) t ne phenomenology of the on-line-GCMG. For the sake 
of completeness, we also report (see Fig. 9), for e = 0, 
the critical line n*(n p ) where x diverges and ergodicity 
breaks down. After some algebra, it turns out to be given 



15 
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FIG. 9: GCMG with A = 0: critical line n* 
X diverges and (39) is violated at e = 0. The dynamics is 
ergodic for n < n*. 
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FIG. 10: 4>\ and (f>o as a function of time for games with A = 
(black lines), 0.01 (red lines) and 0.1 (green lines). P — 100, 
n s = 4, e = 0.05 



by n* s (n p ) = 2/erf(£*), where £* = C( n p) is tne solution 
of 



-S 2 



(rip - l)erf(C) + 1 



(57) 



V. STATIONARY STATE WITH A > 

One may expect that the stationary state of Minority 
Games with A > depends smoothly on A in ergodic 
regions, and indeed some important quantities such as 
the fluctuations and H do behave this way. More sur- 
prising is the vanishing of frozen agents: Fig. 10 reports 
that both 4>o and 4>i seem to cancel for any positive A, 
although this may not appear for finite-time simulations 
at small values of A; the measure was done from t = 1000 
and counts the fraction of agents that are never out of 
the market and in the market, respectively. 



A > 0, the dynamics is ergodic, and accordingly the sta- 
tionary state is unique. This would be compatible with 
a minimized quantity that is not cancelled anymore by 
the dynamics; intuitively, this means that a new term is 
added to H e . Unfortunately, finding such function turns 
out to be impossible in this case, which rules out the use 
of replica-based approach. 



B. Batch GCMG: dynamical approach 

Let us now turn to the effective process (33) with finite 
score memory. As discussed above, in this case scores do 
not diverge with time, i.e. limt_, 00 y(t) < oo, and it is no 
longer possible to separate frozen agents from fickle ones 
by the use of the quantity y = lim^oo y(t)/t. Indeed, 
proceeding as done for the case A = 0, one obtains, in 
place of (46), the condition 



A lim 

t— »oo t 



i 



ae 



'az 



x 



(58) 



where the term on the l.h.s. is finite. In addition, the fact 
that the fraction of frozen players undergoes an extremely 
slow dynamics ultimately suggesting that all agents are 
fickle in the stationary state indicates the necessity of a 
different analytical approach when A =/= 0. Unfortunately, 
have been unable to capture the peculiarities of the score 
dynamics with a simple Ansatz. 

Some general hints can be obtained by calculating ex- 
plicitly the first time step of (33). For simplicity, we 
henceforth adopt the shorthand (z(t)z(t')} = L(t,t') for 
(34). Furthermore, we assume an initial condition y(0) 
for (33) ensuring that all speculators are active at time 
0, so that C(0, 0) = 1 and (by causality) G(0, 0) = and 
L(0, 0) = 1. The transition probability to pass from y(0) 
to j/(l) is given by 



p[y(l)\y(0)] = ^=e t 
Vet 



i r [y(l)-(l-A)y(0)-e(0)+Qn(0)+Qe] 2 

(59) 



As a consequence, we have 



A. On-line GCMG: static approach 

In many versions of the minority game with infinite 
score memory (A = 0) and naive agents, a phase tran- 
sition takes place. H behaves like a physical order pa- 
rameter which is minimized by the dynamics. The latter 
is ergodic as long as H > 0. However, when H = 0, 
the stationary state is note unique, and the dynamics 
becomes non-ergodic: the stationary state is selected by 
the initial score valuation ja(0); this happens in partic- 
ular in the original MG and in the present GCMG with 
e = and A = 0. On the other hand, it is obvious that 
if yi(0) is gradually forgotten, the stationary state can- 
not depend anymore on Ui(0). This is precisely what 
the introduction of the finite score memory does: when 



C(1,0)= / dy(l)dy(0)p[y(l)\y(0)} P (y(0))n(l)n(0) 



1-erf 

_ 1 

_ 2 



cm(0) + ae - h(0) - (1 - A)y(0) 



1-erf 



V2a 

a + ae - (1 - A)y(0) 



2a 



(60) 



where we set h(0) = and used the fact that n(0) = 1, 
and 



G(1,0) 



d 



dh(0) 



dy(l)dy(0)p[y(l)\y(0)} P (y(0))n(l) 



^=exp{-i- [a + ae -(l-A),(0)] 2 } (61) 
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From these, we can evaluate (recalling that in the steady 
state (55) holds) the time-dependent volatilities 

L(1,0) = £(/ + G)~ 1 (l,t)C(t, t')(I + G T y 1 {t\ 0) 
t,t' 

= -G(l,0) + G(l,0) (62) 

and 

L(l, 1) = £(J + G) -1 (l, t)C(t, t')(I + G T ) _1 (t', 1) 



t,v 



= G(l, 0) 2 - 2C(1, 0)G(1, 0) + (7(1, 0) (63) 

From the above formulae we see that if the initial condi- 
tion is large, in particular for y(0) 2> y/a, we have 



lim (7(1,0) = 1 

n s — >-oo 

lim L(1,0) = 1 

n s — >oo 

for < A < 1, and 

lim (7(1,0) = 
lim L(1,0) = 



lim G(1,0) 

l s — >oo 

lim L(l,l) = 



Tlx, — *OQ 



n s — >oc 



lim G(1,0) = 

i s — >oc 

lim L(l,l) = 1 



n s — >oo 



(64) 
(65) 

(66) 
(67) 



for A > 1, provided n p is finite. The latter limits indicate 
that, as is to be expected, for A > 1 the agent de-activates 
immediately after the first time step and starts being ac- 
tive and inactive alternatively. The former limits imply 
instead that the effective agent continues to play. In par- 
ticular, as long as he's playing, he's insensitive to small 
perturbations, so that 



l/(t)=i(l-A)i/(0) + tVaz(0) 



(68) 



Hence one sees that the de-freezing occurs for times of 
the order of (1 — X)/y/a. 

In order to obtain a deeper insight on the station- 
ary states of (33), we now turn to a different approach, 
namely the Eissfeller-Opper method. In a nutshell, the 
idea is to simulate many copies of the effective dynamics 
and calculate relevant physical observables as averages 
over the whole population. The core of the procedure 
lies in the possibility of evaluating the response function 
without actually adding an external probing field. In 
fact, for G{t,t'), which is formally given by ( ) ' one 
can again resort to the noise and write 



s/a oz[v) 



Now the noise distribution P(z) is 



P(z) ~ exp 



\Y,<t)L-\tJ)z{t') 



(69) 



(70) 
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FIG. 11: GCMG with different values of A: numerical solution 
of the effective dynamics for M = 250000 copies with e = 
—0.1. Markers correspond to the volatility as a function of 
time for n s =0.1 (circles), 1 (squares), 2 (diamonds), 4 (up 
triangles) , 8 (down triangles) . Only a few markers are shown 
for simplicity. 



so that, after an integration by parts, one gets 
t-i 

G(t,t') = *£(n(t)z(t"))L- l (t",t') 

t"=0 

t-i 

= £ K(M")£- x (fV) (71) 



The matrix K, as well as the correlation function G, can 
be evaluated by an average over the copies (say, M) of 
the effective dynamics: 



i M 

C(t,t') = -J2Mt)nt(t') 



(72) 
(73) 



i=i 



and the only remaining problem is that of generating 
a noise z(t) having the desired statistical properties. 
This can be done by properly summing and re-scaling 
unit Gaussian variables. We focused again on the time- 
evolution of the quantity L(t,t), whose limit t — > oo is 
linked to the volatility (see (55)). Results for different 
A's and n s at n p = 1 are shown in Fig. (11). One sees 
that finite memory generally increases fluctuations. The 
effect is more pronounced for large values of n s . 



VI. CONCLUSIONS 

To summarzie, we have studied the effects induced by 
a finite score memory in MG-based market models. Our 
main result is that finite score memory does not destroy 
market-like phenomenology in grand canonical minority 
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games, and remedies two embarassing problems of minor- References 
ity games with a fixed set of strategies. From this point 
of view, implementing finite scores is a useful extension 
of GCMG, but cannot be understood fully by current 
analytical methods. 
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APPENDIX A: ORIGINAL MG WITH A > 0. 

The definition of the original MG is similar to that of 
the GCMG, except that all the agents have at least two 
strategies, and one score per strategy. Assuming that 
they have only two strategies, denoted a^i and for 
agent i, each having their respective score y^i and y^- 
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100000 200000 
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FIG. 12: The same realization of the standard minority game 
with unbiased initial strategy scores and A = (circles), bi- 
ased initial scores and A = (squares), biased initial scores 
and A = 0.1 (triangles). (P = 100, a = 0.1) 



The latter evolve according to 

yUt + 1) = yi (t)(l - X/P) - a^ ] A{t) (Al) 

for the on-line MG, and 

y i , s {t + l) = y i {t){l-\)-a^A{t) (A2) 

for the batch MG. At each time step, the agents use 
their strategy that has the highest score. The original 
MG was defined with A = 0, and the usual control pa- 
rameter is a = P/N . There is a phase transition at 
a c = 0.3374 . . . that separates an asymmetric, informa- 
tion rich phase with H > (a > a c ) from a symmetric 
phase with H = (a < a c ). In the latter, the stationary 
state depends on the asymmetry 

U = U itl (0)-Ui,2(0) 

of the initial conditions. This dependence is due to the 
fact that, if A = 0,, all market fluctuations since time t = 
are remembered and contribute with the same weight 
to Ui, s (t), irrespective of how far in the past they took 
place. 

Fig. 12 shows that a time dependent a 2 = (A 2 ) t (here, 
(. . .)( stands for an average over a long but finite time 
interval around t). converges, for long times, to a value 
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which is independent of initial conditions Uq. This im- 
plies that the initial asymmetry Uq has no influence any 
more on the stationary state. 
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FIG. 13: Average volatility a 2 /N = (A 2 )/N versus a = P/N 
of the standard MG with unbiased initial scores and A = 
(circles), A = 0.1 (squares), A = 0.2 (diamonds), with biased 
initial scores and A = 0.2 (triangles). 
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FIG. 14: a 2 /N (black lines) and H/N (red lines) for a real- 
ization of the standard Minority Game with increasing A 



This is confirmed by Fig. 13, where we plot a 2 /N 
as a function of a for various Uq and A. For a fixed 
a and for a given realization of the game, a 2 and H 
are increasing functions of A, although in the symmetric 
phase (a < a c — 0.3374 . . . ) a 2 varies very slowly with A 
(see Fig. 14). While only small values of A make physical 
sense, for completeness the same figure reports also the 
strange effects of very large A. From (Al), A = P cancels 
the contribution of Z7j(t) in [/,(£+ 1); larger A implies that 
this contribution is of opposite sign to the usual MG. 



